Item calibration in incomplete testing designs
نویسندگان
چکیده
This study discusses the justifiability of item parameter estimation in incomplete testing designs in item response theory. Marginal maximum likelihood (MML) as well as conditional maximum likelihood (CML) procedures are considered in three commonly used incomplete designs: random incomplete, multistage testing and targeted testing designs. Mislevy and Sheenan (1989) have shown that in incomplete designs the justifiability of MML can be deduced from Rubin's (1976) general theory on inference in the presence of missing data. Their results are recapitulated and extended for more situations. In this study it is shown that for CML estimation the justification must be established in an alternative way, by considering the neglected part of the complete likelihood. The problems with incomplete designs are not generally recognized in practical situations. This is due to the stochastic nature of the incomplete designs which is not taken into account in standard computer algorithms. For that reason, incorrect uses of standard MMLand CML-algorithms are discussed.
منابع مشابه
Application of Optimal Designs to Item Calibration
In computerized adaptive testing (CAT), examinees are presented with various sets of items chosen from a precalibrated item pool. Consequently, the attrition speed of the items is extremely fast, and replenishing the item pool is essential. Therefore, item calibration has become a crucial concern in maintaining item banks. In this study, a two-parameter logistic model is used. We applied optima...
متن کاملReducing the length of mental health instruments through structurally incomplete designs.
This paper presents structurally incomplete designs as an approach to reduce the length of mental health tests. In structurally incomplete test designs, respondents only fill out a subset of the total item set. The scores on the unadministered items are estimated using methods for missing data. As an illustration, structurally incomplete test designs recording, respectively, two thirds, one hal...
متن کاملAn Automatic Online Calibration Design in Adaptive Testing
An accurately calibrated item bank is essential for a valid computerized adaptive test. However, in some settings, such as occupational testing, there is limited access to examinees for calibration. As a result of the limited access to possible examinees, collecting data to accurately calibrate an item bank in an occupational setting is usually difficult. In such a setting, the item bank can be...
متن کاملOptimal Design for Count Data with Binary Predictors in Item Response Theory
The Rasch Poisson counts model (RPCM) allows for the analysis of mental speed which represents a basic component of human intelligence. An extended version of the RPCM, which incorporates covariates in order to explain the difficulty, provides a means for modern rule-based item generation. After a short introduction into the extended RPCM we will develop locally D-optimal calibration designs fo...
متن کاملOn the complementarity of classical test theory and item response models: item difficulty estimates and computerized adaptive testing
This study aims to provide statistical evidence of the complementarity between classical test theory and item response models for certain educational assessment purposes. Such complementarity might support, at a reduced cost, future development of innovative procedures for item calibration in adaptive testing. Classical test theory and the generalized partial credit model are applied to tests c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010